Recognition of Bangla compound characters using structural decomposition
نویسندگان
چکیده
In this paper we propose a novel character recognition method for Bangla compound characters. Accurate recognition of compound characters is a difficult problem due to their complex shapes. Our strategy is to decompose a compound character into skeletal segments. The compound character is then recognized by extracting the convex shape primitives and using a template matching scheme. The novelty of our approach lies in the formulation of appropriate rules of character decomposition for segmenting the character skeleton into stroke segments and then grouping them for extraction of meaningful shape components. Our technique is applicable to both printed and handwritten characters. The proposed method performs well for complex-shaped compound characters, which were confusing to the existing methods. & 2013 Elsevier Ltd. All rights reserved.
منابع مشابه
Handwritten Bangla Basic and Compound character recognition using MLP and SVM classifier
A novel approach for recognition of handwritten compound Bangla characters, along with the Basic characters of Bangla alphabet, is presented here. Compared to English like Roman script, one of the major stumbling blocks in Optical Character Recognition (OCR) of handwritten Bangla script is the large number of complex shaped character classes of Bangla alphabet. In addition to 50 basic character...
متن کاملHandwritten Bangla Compound character recognition: Potential challenges and probable solution
A novel technique is presented here for recognition of handwritten compound characters of Bangla alphabet. It advocates for incrementally expanding the number of learned character classes from more frequently occurred to less frequently occurred ones. The work is preceded by a survey for finding the frequencies of occurrences of all Bangla characters in the standard literature. One important fi...
متن کاملBanglaLekha-Isolated: A Comprehensive Bangla Handwritten Character Dataset
Bangla handwriting recognition is becoming a very important issue nowadays. It is potentially a very important task specially for Bangla speaking population of Bangladesh and West Bengal. By keeping that in our mind we are introducing a comprehensive Bangla handwritten character dataset named BanglaLekha-Isolated. This dataset contains Bangla handwritten numerals, basic characters and compound ...
متن کاملBanglaLekha-Isolated: A multi-purpose comprehensive dataset of Handwritten Bangla Isolated characters
BanglaLekha-Isolated, a Bangla handwritten isolated character dataset is presented in this article. This dataset contains 84 different characters comprising of 50 Bangla basic characters, 10 Bangla numerals and 24 selected compound characters. 2000 handwriting samples for each of the 84 characters were collected, digitized and pre-processed. After discarding mistakes and scribbles, 1,66,105 han...
متن کاملA New Approach to Bangla Text Extraction and Recognition From Textual Image
This paper presents a new approach to segment and recognize Printed Bangla Text using Characteristic functions and Hamming network. The main difficulties in printed Bangla text recognition are the separation of lines, words and individual characters. In this paper, a new algorithm has been proposed to detect and separate text lines, words and characters from printed Bangla text. The algorithm u...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Pattern Recognition
دوره 47 شماره
صفحات -
تاریخ انتشار 2014